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Abstract 

We study here the spectra of random lifts of graphs. Let G be a finite 
connected graph, and let the infinite tree T be its universal cover space. If 
Ai and p are the spectral radii of G and T respectively, then, as shown by 
Friedman |Fri03j . in almost every n-lift H of G, all "new" eigenvalues of 

H are < O yX^ 2 p^ 2 j . Here we improve this bound to O yX^ 3 p 2 ^ 3 J . It 

is conjectured in |Fri03j that the statement holds with the bound p + o(l) 
which, if true, is tight by [Gre95j. For G a bouquet with d/2 loops, our 
arguments yield a simple proof that almost every <i-regular graph has second 
eigenvalue 0(d 2 ^ 3 ). For the bouquet, Friedman [Fri] has famously proved the 
(nearly?) optimal bound of 2\/d — 1 + o(l). 

Central to our work is a new analysis of formal words. Let w be a formal 
word in letters g x x , . . . , g k 1 . The word map associated with w maps the 
permutations oj, . . . , <7 fe G S n to the permutation obtained by replacing for 
each i, every occurrence of in w by ov We investigate the random variable 
X^f 1 that counts the fixed points in this permutation when the o~i are selected 
uniformly at random. The analysis of the expectation K(Xw^) suggests a 
categorization of formal words which considerably extends the dichotomy 
of primitive vs. imprimitive words. A major ingredient of a our work is a 
second categorization of formal words with the same property. We establish 
some results and make a few conjectures about the relation between the two 
categorizations. These conjectures suggest a possible approach to (a slightly 
weaker version of) Friedman's conjecture. 

As an aside, we obtain a new conceptual and relatively simple proof of a 
theorem of A. Nica |Nica94j . which determines, for every fixed w, the limit 
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distribution (as n — > oo) ofxr. a surprising aspect of this theorem is that 
the answer depends only on the largest integer d so that w = u d for some 
word u. 

1 Introduction 

Let G = (V, E) be some fixed finite connected graph with E = {gi, . . . , g^\, 
and let Ai > A 2 • • • > \\v\ be the eigenvalues of its adjacency matrix. We 
think of the edges as being oriented, though the results do not depend on the 
orientation chosen. We recall that L n (G) denotes the probability space of 
n-lifts of G (i.e., graphs that have an n-fold cover map onto G). A graph H e 
L n (G), has vertex set V x {1, ... , n}. For every (oriented) edge gi = (u, v), 
we choose independently and uniformly a random permutation, cr, e S n , and 
introduce an edge between (u, j) to (v, &i(J)) for all j. For background on lifts 



and random lifts, see [LRM \MM\ IALM021 IHLW06] . In particular [BLl)6] 
shows how to construct regular graph lifts with a nearly optimal spectral 



The projection 7r : H — > G given by ir(u,j) = u is the cover map as- 
sociated with H e L n (G). Every eigenfunction / of (the adjacency matrix 
of) G is pulled back by n to an eigenfunction / o 7T of H. Therefore, ev- 
ery eigenvalue of G is also an eigenvalue of H. Such an eigenvalue of H is 
considered "old" while all other ones are "new" (including, possibly, dupli- 
cates of the old eigenvalues corresponding to "new" eigenfunctions). Let T 
be the (infinite) universal cover of G (and H). We consider l 2 (V(T)), the 
Hilbert space of square-summable real functions on T's vertices, i.e. the 
functions / : V(T) — > R with YlveV(T) f 2 ( v ) < 00 • ^ At is the (infinite) 
adjacency matrix of T, we consider the linear operator corresponding to At- 
Namely, f —> g where g(v) = Y^u~vf( u )- ^ s a bounded linear operator 
on l 2 (V(T)), and we denote its spectral radius by p = p(T) = p(G). (Recall 
that, by definition, p = sup{\u\ : v is in the spectrum of At})- Our main 
result is a new bound on if's new eigenvalues in terms of Ai (the largest 
eigenvalue of G), and p. 

Let pmax '■= max{\p\ : p, is a new eigenvalue of if}. In |Fri03j . Fried- 
man showed that p ma x < A] 1 /? 1 / 2 + o n (l) for almost every H. We improve 
this bound as follows: 

Theorem 1. Almost every random n-lift H of G satisfies: 



gap. 
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More Specifically, 

/W < max ^1, 3 (j^J ' j • A/V 73 + e (1) 
almost surely for every e > 0. 

Remark 2. Let T be a (not necessarily finite) connected graph, and let v be 
a vertex in V. We denote by t s (v) the number of closed paths of length s 
that start and end at v. It is well known that the spectral radius of F equals 
limsup^^ t s (v p s . In particular, this value is independent of the choice of v. 
(These facts may be proven by an easy variation on the proof of Proposition 
3.1 in |Buc86j .) Returning to our notation, observe that a path that starts 
and ends at a vertex v G V(T) is projected to a path of the same length 
that starts and ends at the corresponding vertex of G. Consequently, p < Ai 
always holds. 

Our proof of Theorem [I] suggests an approach that may lead to an even 
better (nearly optimal) bound p max < O (p). This plan depends on an un- 
resolved conjecture that we present shortly. It follows from Lubotzky and 
Greenberg [Gre95j that this statement cannot hold with any bound smaller 
than p — o(l). It is shown in [Gre95j that for every infinite tree T and for 
every e > 0, there exists a constant c = c(e, T) > 0, such that if T is the 
universal covering space of a finite graph T, then at least c|V(r)| of T's eigen- 
values exceed p(T) — e. Thus for G fixed, and for e > there exists an n € 
such that p m ax(H) > p{G) — e for every n > n e and every H e L n {G). (Since 
the infinite (i-regular tree Td has spectral radius p(T^) = — 1 ( |Car72j ). 
this extends the Alon-Boppana bound |Nil91] that A2 > 2y 'al — 1 — o n (l) for 
every n- vertex (i-regular graph). 

The "permutation model" of random rf-regular graphs (for al even) is 
a special case of random lifts of graphs. In the permutation model, n- 
vertex <i-regular graphs are generated through a random n-lift of a bou- 
quet of d/2 loops. Thus, our result, as well as Friedman's, extend earlier 
work on random (i-regular graphs. Namely, Friedman's result states that 
\{G) < v / 2civ / ^I + o(l) for almost every ci-regular graph, which is a slight 
improvement of an old result of Broder and Shamir [BS87]. In this special 
case, Theorem [1] states that \{G) = 0(d 2 ^ 3 ) holds almost surely, and the ten- 
tative proof strategy mentioned above would yield A(G) = 0(d 1 ^ 2 ) almost 
surely. In particular, we obtain the following corollary (which is, of course, 
substantially weaker than the one proven in |Frij ): 



3 



Corollary 3. If G is d-regular and d > 107, then 

fi max <\ 1 1/3 p 2/3 + e=Md-l)} l/3 + e. 
almost surely for every e > 0. 

A major tool in this area is the Trace Method which goes back to Wigner 



Wig55| . It is based on a natural connection between graph spectra and 
word-maps. This approach underlies the work of Broder-Shamir [ [BS8 7] and 
of Friedman |Fri03] . 

Let w be a (not necessarily reduced) formal word in the letters gf 1 , . . . , g^ 1 . 
For every fe-tuple (a±, . . . , a k ) of permutations in S n , we form the permutation 
w((Ji, . . . , (Tfe) G S n , by replacing g 1 , . . . , g k with o~x , . . . , a k in the expression 
of w. For instance, if w = 929\9i ~ #3> then u>(o"i, a 2 , 03) = a 2 o-^a 2 l a 3 . The 
correspondence between w and the permutation w(ai, . . . , cr^) G S n is called 
a word map. Such maps can be evaluated in groups other than S n as well (we 
refer to this briefly in Section [2]) . The study of word maps has a long history 
in group theory (see |LSh07] and the references therein). Our perspective is 
mostly combinatorial and probabilistic. 

For fixed formal word w we denote by a random variable on 
which is defined by: 

Xj™\ai, . . . , a k ) = # of fixed points of w(a x , . . . , a k ). (2) 



Now let H be an n-lift of G and let Aq, Ah be the adjacency matrices 
of G, H resp. We denote by /i a running index for the "new" eigenvalues of 
H . For every t > 1, the trace of A H l equals the number of closed paths of 

length t in H. This number can also be expressed as flj + ($^1=1^' <Vj • 
Therefore, for t even we obtain: 

/ \V(G)\ \ \V(G)\ 

^ < E ^ = E a*' + E v - E v = tr (^) - tr ( A c ) 

\ /i i=l / i=l 

Every closed path in if is a lift of a closed path in G. Since the edges 
of G are labeled gi,...,g k , every (closed) path in G corresponds to some 
formal word w in gf 1 , . . . ,g k hil . The closed lifts of this path are in 1 : 1 
correspondence with the fixed points of w(ai, . . . , a k ), so that their number 
is Xw((Ti, . . . , a k ). Let CVt{G) denote the set of all closed paths of length t 
in G (i.e., \CVt(G)\ = tr{Ao)). The above inequality now becomes: 
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n 



(cti, ...,a k ) 



w€CVt(G) 



Taking expectations, we obtain: 



1] 



(3) 



w&CP t (G) 



Equation (j3J) shows the significance of E(JQT ) — 1 in the study of spectra 



where the Oi(iu) are integers. (This fact appears in |Nica94j . but we present 
(Lemma Hj) a new and simpler proof). This induces a categorization of words 
in Ffc, the free group with generators gi, . . . , g k . Namely, 4>(w) is the smallest 
index i for which a,i(w) ^ 0, or oo if E(xL n) ) = 1. 

We consider next (Section 12.21) another categorization of the words in 
Ffc, which does not depend on a word-map to specific groups such as S n . 
To every to 6 Ft we associate (3{w) which is a non-negative integer or oo. 
The categorizations induced by both 4>(w) and (3(w) extend the dichotomy 
between primitive and imprimitive words (Recall that w is called imprimitive 
if w = u d for some u G F& and d > 2). Without going into the (somewhat 
lengthy) definition, let us say that the main step in both [BS87j and [Fri03j 
can be viewed as the observation that for i — 0, 1, <j){w) = i iff (3{w) = i. 
Our aforementioned conjecture states in this language that <f>(w) = P(w) for 
every word w (Conjecture [15]). These relations between and @ allow us to 
bound the sum in the r.h.s of We can bound the contribution of w to 
this sum in terms of <p(w). This is complemented by bounding the number of 
words w G CVt(G) with a given value of (3(w) which bound is stated in terms 
of p. Indeed, a key step in the present paper (Lemma [20]) can be interpreted 
as a partial proof of the claim that (3(w) = 2 iff (f)(w) = 2. 

As an aside to our work we obtain a new conceptual and relatively simple 
proof of a theorem of A. Nica |Nica94j . which determines for every fixed w the 
limit distribution of x4 as n — > oo (see Theorem [231) . We carry out a similar 
analysis for all higher moments of and use the method of moments to 
derive Nica's result. A surprising aspect of this theorem is that the limit 




, it turns out that for every 




i=0 
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distribution depends only on the largest integer d such that w = u d for some 
u G Ffc. Nica's full result (which we derive by the same argument) concerns 
not only fixed points but applies just as well to the number of L-cycles for 
any fixed L > 1. 

The paper is arranged as follows. We begin (Section EJ) with our analysis 
of word maps and introduce the two new categorizations of formal words. 
Based on this analysis, we prove Theorem [T] in Section [3J In Section H] we 
deal with the distribution of the number of L-cycles in w(a\, . . . , a^) and 
present our new proof for Nica's Theorem. For the reader interested only 
in this new proof, this section is mostly self-contained with only occasional 
references to earlier parts of the paper. There are numerous open problems 
and conjectures raised in this paper, some of which we collect in Section 

2 Word Maps and the Level of Primitivity of 
a Word 

We begin with some notation. We denote by the set of all finite words in 
letters gf 1 , . . . , g^ 1 (though we occasionally use the letters a, b, c, . . . instead). 
The quotient of modulo reduction of words is the set of elements of the 
free group on k generators. For instance, the set CV t (G) introduced before 
Equation © is a subset of so it may contain different words which are 
equivalent as members of F&. 

For every group P and every word w G the word map w : P k — > P 
is defined by substitutions and composition. For pi, . . . ,pk G P, the element 
w(pi, . . . ,Pk) is obtained by substituting pi for each occurrence of ^ in w 
and this for every 1 < % < k. Clearly, the word map of w is invariant under 
reductions, so we can regard w as an element in F&. 

Most research on word maps concerns the range of certain fixed words w 
in a group P. More specifically, for P finite, it is of interest to understand 
the distribution induced on P by the word map w : P k — > P and the uni- 
form distribution on P k . This perspective makes it natural to consider an 
equivalence relation on words (beyond that of reduction). 

In order to introduce this equivalence relation, we now recall some simple 
terminology from combinatorial group theory. There are three elementary 
Nielsen transformations defined on the free group F^: (i) Exchanging any 
two generators g>j and gj for some i j, (ii) Replacing some g i with g^ 1 , 
(iii) Replacing any g>j by g^gj, for some i ^ j. We recall (e.g. [MKS66J, 
Theorem 3.2) that these transformations generate the automorphism group 
Afc of Ffc. We say that two words Wi,w% G F& are equivalent, and denote 
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W\ ~ W2, if they belong to the same orbit of A*.. Obviously, " ~ " is an 
equivalence relation. It is quite clear that for every finite group P, every two 
equivalent words u>i,u>2 G induce the same distribution on P. We do not 
know whether the converse is true as well, and we state a specific problem 
(Conjecture [T71) in this vein. 

Given a word w and the distribution it induces on a group P, it is of in- 
terest to consider how far this distribution is from the uniform distribution. 
The two gradings of words </>(■) and /?(•) can be viewed as our attempts to 
capture this intuition. Both parameters associate a non-negative integer or 
00 with every w G Ffc, and they tend to grow as the above-mentioned dis- 
tance decreases. Of course, the distribution furthest away from the uniform 
distribution corresponds to the word w — 1. Indeed, (3{w) = iff <f>(w) = 
iff w = 1 (Lemma [T2"]) . Also, (3(w) = 1 iff <f>(w) = 1 iff w is imprimitive 
(Lemma fT3l) . In this case the range of w contains only powers of certain 
exponent. (Recall that w G is called imprimitive if w = u d for some word 
u and d > 2.) At the other end of the scale, both (3(w) and 4>{w) equal 00 
for words that are ~-equivalent to a single-letter word. Clearly, such words 
always induce the uniform distribution on P. Another important property 
is that both (3 and <fi are invariant under " ~ " . 

The definition of <f)(w) depends on the word map for the symmetric group 
S n (see Section I2TTT) . The definition of /3(w) is more involved and is based 
on a certain analysis of w as a formal word without reference to groups (see 
Section |2~21) . In fact, we have arrived at our definition of (3{w) through our 
study of <p{w). Some proven results and extensive numerical simulations 
suggest that 4>{w) = (3{w) for every w (Section l2.3p . One advantage of the 
parameter f3 over is that we can bound the number of words with fixed 
value of (3 - see Section 13.11 

Since both <p{w) and f3(w) offer an extension of the primitive-imprimitive 
dichotomy for words, we tend to think of them as quantifying "the level of 
primitivity" of a word (this level is if w — 1, it is 1 if w is imprimitive, and 
> 2 for primitive words - see Section l2~3l) . 

2.1 The Word Map w : 5 n * S n and 

In this section we present a method to calculate the expectation of the 
number of fixed points in w(ai, . . . , o~k) (defined in (jSJ)). We count the fixed 
points in w(ai, . . . , 0^) for all fc-tuples (<7i, . . . , o&) G and divide by (n!) fc . 
This calculation is carried out through a certain categorization of all fixed 
points. We note that similar considerations appear in |Nica94j and in [Fri91] . 
We begin with some technicalities. Let w = g^g^ 2 ■ ■ ■ g°^ G Ejt, where 
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h,---Jm e {1, . . . , k} and a u . . . , a m G {-1, 1}, and let ax,...,a k G S n . 
Assume that s G {1, . . . , n} is a fixed point of w(ai, . . . , cr k ). Associated 
with so is the following closed trail: 

a-r ao OLrt a m—l nvrJ 

a ■ a- (T- cr- cr. m 

so — > Si — > s 2 — > . . . — ► S m _i — ► S 

with si, . . . ,s m _i G {1, . . . ,n}, and s b mod m = a™ b {s b „ x ) {b = 1, . . . ,m). 

Note that for the sake of convenience, we compose permutations from left 
to right. This is inconsequential for the analysis of the variables X^f 1 since 
w(ai, . . . , crfc) with left-to-right composition is the inverse of w((t{~ , . . . , cr^T 1 ) 
with right-to-left composition, and thus both have the same cycle structure. 

We categorize fixed points according to their associated trails. Let s — > 
. . . — > s m _i — ► s and s' Q — > . . . — > s' m _ 1 — > s be the trails of the fixed points 
so and s in w(ai, . . . a k ) and w(a[, . . . , a' k ) respectively. These two trails are 
placed in the same category, if they have the same coincidence pattern, that 
is, if for every i,j G {0, . . . , m — 1}, Sj = Sj s' { = s'j. 

Each closed trail consists of m integers, or points, possibly with repe- 
titions, and each category of trails uniquely corresponds to some partition 
of these m points. Consequently, there are at most B(m) categories, where 
the m-th Bell Number, B(m), is the number of partitions of an m element 
set. This bound is, however, not tight. For instance, if s a+ i = crj(s a ) and 
Sb + i = a~ (s&), then s a = s b+1 <^ s a+1 = s&. Therefore, not every partition 
corresponds to a realizable category of trails. 

It is convenient to associate a directed edge-colored graph T with each 
category. Vertices in T correspond to blocks in the partition that defines T's 
category. In other words, T has as many vertices as the number of distinct 
integers among the s&'s. There is a directed edge labeled j from one vertex 
(=block) to another, whenever the trails include an arrow labeled aj (resp. 
o~f l ) from a point in the first (second) block to a point in the second (first) 
one. 

Of special importance is the graph associated with the finest possible 
partition which we call the universal graph. Two points in the trail are merged 
in this partition if and only if they are merged in every realizable partition. 
If w is cyclically reduced (i.e., no two consecutive letters are inverses, nor are 
the first and last letter), this is the partition where all m points in the trail 
are distinct. To illustrate, we draw in Figured] the universal graph of three 
different words. 

All other graphs are now easily derived as quotients of the universal graph, 
or partitions of its vertices. A quotient graph has one vertex per each block 
in the partition. It has a j-labeled directed edge (j'-edge for short) from block 
V\ to block V2, if the universal graph contains a j-edge from a vertex in v\ to 
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lA 



Figure 1: From left to right: the universal graphs of w = 9\929\ #2 1 (the 
commutator word), oiw = 9\929292~ X 9i929{ X , and of w = gi9i 1 929^9^ 92 1 ■ 



a vertex in v 2 - A quotient is not realizable if it contains two distinct j-edges 
with common head and different tails or vice-versa. We denote by Q w the 
set of all realizable quotients . 

To illustrate, we draw (Figure [2]) all the realizable quotient graphs of 
the universal graph of the commutator word (one of the graphs in Figure 
[T]). Note that a four element set has 15 partitions (the fourth Bell number, 
.8(4) = 15), of which only 7 are realizable in this case. 

These graphs suggest a simple formula for the number of fixed points in 
each category. Let i>r, (er) be the number of vertices (edges) in the graph 
T, and ef be the number of j-edges (and so er = X]j=i e r)- To count the 
number of fixed points in T's category, or the number of realizations of T, we 
first label T's vertices by distinct numbers from {1, . . . , n} (i.e., specify the 
values of so, ... , s m -i) . This can be done in n(n — 1) . . . (n — t>r + 1) ways. 
For each j = 1, . . . , k there are (n — ef )! permutations that are consistent 
with the ep values in the permutation <jj that are already determined. Thus, 
the number of realizations of T is: 

k 

N r (n) =n(n-l)...(n-v r + l) J[ [n - ef] ! (5) 

3=1 

A formula for the expectation of is now at hand: 
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Figure 2: The 7 different graphs in Q w , the set of graphs of categories of 
fixed points when w = g\gig\ g% • Each graph is drawn together with the 
partition of the universal graph which yields it. (We do not specify the block 
corresponding to each vertex). 



= W E ^W..,^) = 7^£iVr( 
v ' ; ai,...,<T h eS n v ' ; reQ™ 

n{n — 1) . . . (n — + 1) 



r£n^i^-l)---("-4 + l) 

(The third equality holds only for n that is > ef for all j and T.) 

We illustrate these calculations for w the commutator word gig^i 9% ■ 
If we go over the graphs in Figure [5] in clockwise order starting at the upper- 
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left graph, © becomes: 



(n) n{n - T){n - 2){n - 3) n(n-l)(ra-2) 

1 wJ [n(n- l)][n(n- 1)] [n(n - l)][n(n - 1)] 

nfn — 1) n(n — 1) n 

H 1 1 1 h 

[n(n — l)][n] [n][n(n — 1)] \p\\p\ 

n(n — 1) n(n— l)(n — 2) n 



[n(n — l)][n(n — 1)] [n(n — l)][n(n — 1)] n — 1 



In (j3J) we defined <& w (n) to be 1V^£_L_L. The following lemma states an 
important property of 



Lemma 4. Associated with every w G zs a power series Y2iLo a i( w ) xl 
that has a positive radius of convergence where the coefficients di(w) are 
integers. In particular, for every w and for every sufficiently large n, there 
holds $ w {n) = Y,T=Q a i( w ) hi- 
proof. As we saw in OH]), for large enough n (e.g., n> \w\ suffices), 

1 ^yr-vr+l _ ^ 



Since every T is a connected graph, we have er — v r + 1 > 0. The lemma 
follows when we individually consider the expression corresponding to each T. 
(See the proof of Lemma for a thorough analysis of these expressions) . □ 



By Lemma HI the contribution of every w G CVt(G) in (jHJ), is 



a,i(w)+o(l) 



i i— 1 



for some nonnegative integer i. This induces the following useful grading of 
words: 



the smallest integer i with a,i(w) ^0 ifE(xi n) )^l 



Recall that although the construction of <& w depends on the actual repre- 
sentation of w (a reduction of w usually changes Q w ), this function captures 
some features of the distribution of the image of w on 5„. Thus $ w , as well 
as (f>(w), are invariant not only under reduction, but also under " ~ ". 

With this new terminology we can reinterpret both [BS87 and [Fri03j 
as follows: Both papers rely on the fact that <f>(w) — iff w reduces to the 
empty word, and that 4>(w) = 1 iff w is imprimitive (see Lemmas H2] and 
T3l) . To study the new spectrum of random lifts, one proceeds as follows: 
The number of words of these two kinds can be bounded in terms of p (the 
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spectral radius of the universal cover) alone (and does not depend on Ai, the 
spectral radius of the base graph). Finally, the rest of the words (which are, 
in fact, the vast majority) contribute to the summation in ([3]) only O (-) 
each. 

Here we extend these ideas and seek (with partial success) a similar char- 
acterization for all words with 4>(w) = i for fixed i. Our analysis of the new 
spectrum extends these arguments and refines them. We further split the 
above-mentioned third set and attain an improved bound on the contribu- 
tions of these subsets to the sum in Equation ([3]). 

2.2 More on the Level of Primitivity: f3{-) 

We now start our second attempt at capturing the "level of primitivity" of 
w by means of the parameter (5{w). We begin with some definitions. 

Recall the notion of a trail from Section 12.11 Consider then the following 
trail through w = g^g,^ 2 ■ ■ ■ g^ (this time, the Sj's should not be thought 
of as numbers but rather as abstract symbols, and the trail deliberately ends 
with s\ w \ and not with s ): 

°1 «2 «3 g"™" 1 g a ™ 
9i 1 9i 2 9i 3 Vl-i M 
SO ► Si > S 2 > . . . > S| w |_x ► S| w | 

As in our former definition of a realizable category of trails, we say that 
a partition of {sq> • • • , s \w\} is realizable if the following conditions hold: 
Whenever ih = %i and ah = ai, Sh-i is in the same block with si-i (we 
denote Sh-i = S;_i) iff Sh = S[. Likewise, whenever i h = i\ and = —ai, 
s h _i = Si Sh = Si-i. (In other words, a partition is realizable whenever 
it traces a trail of some point, fixed or not, through u>(<7i, . . . , <Jk) fc> r some 
ai, . . . , <jfc G S n and some n.) 

As before, to each realizable partition of {so, . . . , s\ w \} corresponds a di- 
rected edge-colored graph T, which is a quotient of the graph of the trail. 
According to our former notation, T G Q w whenever s = s\ w \. We now 
concentrate on the number of pairs of Sj's that should be merged in order to 
yield a specific Y. 

Definition 5. Let V be the quotient graph corresponding to some realizable 
partition of {s , . . . , s\ w \}. We say that set of pairs {{sj t , s^}, . . . , {sj r , Sk r }} 
generates T, if V corresponds to the finest realizable partition in which 
Sji = s ki Vi = 1, . . . ,r. 
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Example 1. Let us return to the commutator word w = g\g29\ X 9i 1 - The 
trail here is 



91 92 9\ 1 9 2 1 
So — > Si > S 2 — > S 3 > S 4 . 



In Figure [3] we revisit the seven quotient graphs from Figure [2] (the seven 
graphs in Q w ), and specify a smallest generating set for each of them. 




{{s ,s.}} 




[s ,s 4 j, is i ,s 3 j 




2 2 




1 2 




1 2 

GO 



[s ,s,j, {s o ,s 2 j 



[s ,s 2 j, [s r s 3 ), [s o ,sj 



Figure 3: A smallest generating set of each of the seven graphs in Q w , the 
set of graphs of categories of fixed points when w = <7i<?2<7i ~ 9% ■ 



We denote by x(T) = er — v r + 1 the Euler characteristic of T. It turns 
out that there is a tight connection between xCO and the smallest size of a 
generating set of T: 

Lemma 6. Let T be the quotient graph corresponding to some realizable 
partition of {so, . . . , s\ w \}. The smallest cardinality of a generating set for T 

is x(r). 

Proof. It is quite easy to construct a set S of x(T) pairs that generates V. 
To this end, we adopt the original terminology of [BS87J. As we follow the 
path of w through T, each move has one of three types. In a free step 
we traverse a new edge and reach a new vertex, and so one vertex and one 
edge are added to the partial graph. In a coincidence a new edge leads 
us to an "old" vertex, so we gain one new edge and no new vertices. In a 
forced step we traverse an old edge (necessarily to an old vertex), so the 
numbers of vertices and edges remain unchanged. Consequently, equals 
the number of coincidences in this walk. We introduce into S one pair for 
each coincidence. If the j-th step is a coincidence in which we reach a vertex 
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in T representing the block s^, . . . , s ir (i± < . . . < i r < j), we add {sj, s^} 
to S. Clearly, the cardinality of S is x(r) and it generates T. 

To see that T has no generating set smaller than S, consider S, the 
collection of all generating sets of T of the smallest possible cardinality. We 
claim that S is the lexicographically first member of S. Concretely, write each 
pair under consideration as {si,Sj} with i > j. Now sort the pairs in each 
S G S in increasing lexicographic order and let T G S be the lexicographically 
first member of S. Our claim is that T — S. Observe that if {-Si,Sj} G T 
then there is no index h < i with h ^ j with {si,Sh} G T. Otherwise we 
could replace the pair {si,Sj} with the pair {sh,Sj} and generate the same 
quotient as does T with a lexicographically smaller set of pairs. 

It is helpful to consider for each 1 < % < \w\ the graph Tj that is the 
quotient of s — > . . . — > Si generated by the initial segments of T that includes 
only those pairs in T where both indices are < %. We claim that is a 
subgraph of Tj for all %. If there is no pair in T where is the larger member, 
this is clear. If {si,Sj} is in T then the index j is uniquely defined by the 
above remark. In this case we need to show that the identification of Sj 
with Sj does not entail any additional identification (which would violate the 
inclusion C Tj). How can such an identification occur? Only if the label 
of the edge agrees with that of an edge e incident with the block 

that contains Sj (and has the correct orientation). Let s u be a member of the 
block at the other end of e. Clearly v < i. But now again we can generate 
the quotient generated by T by a lexicographically smaller set, i.e., replace 
the pair {sj, Sj} by {s^, s u }. 

We can again recognize the three types of steps by observing how the 
graphs Tj grow at each step. If Tj stays unchanged, this is a forced move. If 
only an edge is added, this is a coincidence and in a free step one vertex and 
one edge are added. It is exactly at each coincidence step that vertices at 
T get merged. But this is precisely what we did in constructing S, so that 
S = T, as claimed. 

□ 

The following categorization of the quotient graphs in Q w turns out to 
be very useful: 

Definition 7. Let w be a word in S fc . We say that a quotient graph T G Q w 
has type A, if one of the smallest generating sets for F contains the pair 
{so,s\ w \}. Otherwise, we say T has type B. 

Given a word w, we classify the graphs in Q w according to their charac- 
teristics and type. Note that x(T) < since every T has at most \w\ edges. 
We illustrate this again with the seven graphs of the commutator word: The 
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figure-eight graph with one vertex and two edges has type B. The other six 
graphs have type A (their generating sets specified in Figure [3] purposely 
include {so,S4}). Table Q] shows the whole census. 



X 





1 


2 


3 


4 


Type 


A 





1 


4 


1 





Type 


B 








1 









Table 1: The number of graphs in Q w in any specified character and type, 

when w = gmg^g-T 1 - 

We are now ready for the second definition for a word's "level of primitiv- 
ity". Let wbea word in S fc . We define (3{w) to be the smallest characteristic 
of a type-B graph in Q w . Namely, 

/3(w) := min ({x(r) : T E Q w and T has type B} U {oo}) (9) 

Example 2. For the commutator word /3(w) = 2. 

In the next few lemmas we establish several properties of /?(•) which 
are clearly desirable. Among others, /?(•) is proved to be invariant under 
reductions, and hence well defined as a function on 

Lemma 8. /?(•) is invariant under cyclic shifts. 

Proof. Let w G and let w' be some cyclic shift of w. There is an obvious 
bijection between Q w and Q w >, obtained by applying the appropriate cyclic 
shift on the indices of the Sj's in the blocks' names. (E.g, if w' is attained 
from w by a right cyclic shift of two positions, replace each label s» in w by 
s'j in w' where j = i + 2 mod \w\.) We claim that in addition, each V G Q w 
has the same type in Q w as its matching quotient V in Q w >. It suffices to 
show that if V has type A in Q w , then T' have type A in Q w r (It then follows 
by symmetry that T has type A in Q w iff T' has type A in Q w i). 

Let S be a smallest generating set of Y with {s , S| M |} G S*. Given S, we 
can generate T gradually, through a series of quotients. To proceed from the 
quotient Tj to the next quotient, we add a pair {sj,s r } G S. To determine 
r i+1 we carry out all necessary identifications and only them. Formally, r i+1 
is the finest realizable quotient of Tj in which the pair {sj, s r } is merged. It is 
easily verified that the final quotient is T regardless of the order at which the 
pairs in S are introduced. But merging {so, s\ w \} in w, and merging {s' , sj^} 
in w', yield equivalent quotient graphs (these are the universal graphs of w 
and of w', as defined in Section |2~D around Figured]). We can now proceed 
by applying the same series of quotients as described above on both universal 
graphs, to conclude that T' is of type A with respect to w' as well. □ 
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Lemma 9. Let w G and let w' be its reduced form. Then (3{w) = (3{w'). 

Proof. We need to show that (3 does not change when a letter and its in- 
verse are inserted consecutively into a word. But Lemma El says that j3 is 
invariant under cyclic shifts, so it suffices to show that j3(w) = f3(w') for 
w = g^gg* . . . g£r and w' = 9i ?gg> . . . /C // ; // ; ; . To see this, we define 
below for every quotient T G Q w the subset e(r) C Q w > of all consistent 
extensions of T. The set e(T) contains a certain member S(T) which plays a 
special role. The relevant properties of e and 5 are: 

• The union of the images of e is all of Q w >. 

• If Ti 7^ T 2 G Q w , then e(Ti) and e(T 2 ) are disjoint. 

• If T G Q w has type A, then all members in e(T) have type A. 

• For every V G Q w , one graph 5(T) G e(T) has the same Euler charac- 
teristic and the same type as T. All other members in e(T) have Euler 
characteristic x(r) + 1. 

It should be clear that these properties prove the lemma. 

If v G V(r) is the vertex corresponding to s m , then e(T) is the set of all 
extensions of T G Q w where there is a j-edge (an edge labeled j) starting at 
v . If T already has such an edge, then we can attain a graph T' G Q w > by 
adding s m+2 to the block containing s m , and adding s m+ i to the block at the 
end of this j-edge. We then define e(r) = {5(T)} = {V} w {r} (We use 
to denote equality as vertex- unlabeled-graphs.) 

Otherwise, e(T) includes all the (vr — ef + 1) different possible extensions 
of r with such an edge. We only need to specify the other vertex of this new 
edge, that corresponds to s m+ i. In the graph 5(T) the vertex s m+ i is new and 
so is the j-edge (s m , s m +i). Clearly, x(5(r)) = x(T), as claimed. Otherwise 
this additional edge can go from s m to any of the — ef vertices in T which 
are not tails of a j-edge. Such graphs clearly have characteristics x(H + 1. 

We prove the first two properties of e by recovering, for every T' G Q w > 
the (unique) graph T G Q w with V G e(T). We consider the (m + l)-st 
step in the path of w' through T' (the step from s m to s m +i), and use the 
notations of Lemma El If this step is free, then T is obtained from T' by 
deleting the vertex corresponding to s m+ \ and the edge (s m ,s m+ i) (as well 
as, of course, omitting s m+ 2 from its block). If it is a coincidence, then clearly 
r«r' \ (s m , s m+ i). Otherwise, it is forced and so T m V. 

We want to show next that if V G Q w has type A, then all graphs in e(T) 
have type A as well. By assumption F is generated by a set S of cardinality 
\S\ = x(r) and {s , s m } G S. Note that s m = s m+ 2 in every quotient of w'. 
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Therefore, when we consider generating sets for graphs in Q w >, the vertices 
s m and s m+2 play the exact same role. We therefore define 5" to be the 
set of pairs that is attained by replacing each occurrence of s m in S with 
s m+ 2- Clearly, 5" generates the graph S(T) e e(T). For any other graph in 
e(r), we add to S' the pair {s m+ i,Si} where Sj (i < m) corresponds to the 
vertex which (s m , s m+ i) goes to. In each of these cases we found a smallest 
generating set 5" that includes the pair {s ,s m+2 }, so all members of e(T) 
have type A. 

Finally, we need to show that if T 6 Q w has type B, then so does 5(T). 
So suppose S(T) has type A, with a smallest generating set S' that contains 
the pair {s ,s m+2 }. Let us construct a set of pairs S by replacing each 
occurrence of s m+2 in S' by s m . If S' contains some pair {s m+1 ,Sj}, then 
clearly s m+ \ is not a new vertex in 5(T), and we are necessarily in the case 
where T pa 8(T) (recall that T and S(T) have the same characteristic). In 
this case the edge (s m , s m+ i) is not new, so it is merged with some (s r _i, s r ) 
(or (s r+ i,s r )) for some r < m. Thus, we can replace each s m+ i in S' with 
s r . This is a contradiction since S is a smallest generating set of T which 
therefore has type A. □ 

Note that from Lemmas El and [9] it follows that f3 is invariant under cyclic 
reduction as well, or under conjugation. Similar arguments show that it is 
also invariant under the equivalence relation " ~ ", but we do not include 
the proof. In the following lemma we state an important property of type-B 
quotient graphs. This property plays a crucial role in the sequel, where we 
introduce a bound to the number of words with some fixed value of /?(•). 

Lemma 10. Let Y G Q w have type B. As we trace the path of w through T, 
every edge in T is traversed at least twice. 

Proof. We show that if some edge e is traversed only once, then T has type 
A. Lemma M allows us to assume that e is the last step in the path of w, i.e. 
the step from S| TO i_i to s\ w \. In the proof of Lemma EJ we constructed S, a 
generating set of T of smallest cardinality, with one pair for each coincidence 
in the path of w through T. Here, the last coincidence corresponds to the 
pair {so,S|„|}. Therefore T has type A. □ 

Remark 11. The converse is not true. There are quotients of type A where 
every edge is traversed more than once. Consider the word w = ababa. One 
of the quotient graphs in Q w is a figure-eight with one vertex and two loops. 
Each edge in this quotient is traversed twice or thrice, but the quotient has 
type A. It is generated by the two pairs {sq, s 2 }, {sq, s^}. 
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2.3 Some Connections between (f)(w) and f3(w) 

We now turn to examine the relation between and f3. We first observe that 
a,i(w) (the coefficient of in the power series form of $ w (n)), is completely 
determined by quotients in Q w with characteristic < i. This is easily verified 
by considering the contribution of each T G Q w in ([7]). We are now able to 
use our new perspective and show that (as mentioned above) for i = 0,1, 
(j)(w) = i -v=> (3{w) = i. 

Lemma 12. For every w G <p{w) = <^=> f3(w) — 43> w = 1. 

Proof. By Lemma El the only quotient graph of w with characteristic is the 
graph T generated by the empty set. Now <p{w) = ao(ti>) >0 T G 
*"> so = si w | in T w reduces to 1. If T G then T has type B by 
definition. Thus (3(w) = T G Q w . □ 

Lemma 13. For every w G F k , <p(w) = 1 /3{w) = 1 ^ w is imprimitive. 

Proof. Let w G be in reduced form and assume w ^ 1. By Lemma [T^| all 
T G Q„, have a positive characteristic. The definition of <& w {n) clearly yields 
thatax(w) = |{r G Q w : x(r) = 1} [ — 1 . In this case the single pair {s , s\ w \} 
is a smallest generating set for the universal graph (defined in Section I2TTI) . 
so at least one quotient has characteristic 1. Obviously, any other quotient 
with x = 1 is not generated by {s , s\ w \}, and thus (by Lemma|6} has type B. 
Thus 4>{w) = 1 -v=> (3{w) = 1 -v^ such additional quotients exist. We complete 
the proof by showing that the latter is true iff w is imprimitive. 

Let w' be the cyclic reduction of w. It is easy to verify that w ~ w' 
whence <fi(w ) = cf)(w') and w is primitive iff so is w'. Lemmas M and [9] yield 
that /3(w) = /3(w') as well. Thus we can assume, for simplicity, that w is 
cyclically reduced. 

In this case, merging sq and s\ w \ implies no other identifications, and the 
universal graph is a cycle of length |iw|. If w is imprimitive, there is some 
u G Ft and d > 2 such that w = u d . Clearly, u is cyclically reduced as well, 
and the universal graph of u is a cycle of length \u\ which is an additional 
quotient of characteristic 1 in Q w . 

On the other hand, since w is cyclically reduced, every vertex in every 
T G Q w has degree > 2. Thus, if x(T) = 1, it is necessarily a cycle. As the 
path of w through T is non-backtracking and w ^ 1, it consists of tracing 
this cycle some d > 1 times. If d — 1, V is the universal graph. Otherwise, 
if we let u denote the word corresponding to a single traversal of the cycle, 
then w = u d . □ 

The contents of Lemmas fT2l and [TBI appear in different language in [BS87J , 
in |Nica94j and in [Fri03] . But the relation between (/>(■) and /?(■) goes deeper. 
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For instance, for the single-letter word w = a both (f>(w) = /3(w) = oo. The 
reason for <p(a) = oo is that the expected number of fixed points in a random 
permutation equals 1. On the other hand, Lemma [TOl implies that (3(a) = oo. 
Also, as already mentioned, both and (3 are invariant under " ~ ", so 
they are both infinite on the entire equivalence class of a under " ~ ". (In 
particular, every w in which some letter appears exactly once belongs to this 
class.) 

The following lemma expands even further the relation between </>(•) and 
/?(■). The next natural step would be to prove that <p(w) = 2 <^ (3{w) = 2. 
This, in other words, says that for primitive words, f3(w) > 3 iff 0,2(10) = 0. 
This is, at present, still beyond our reach and we content ourselves with a 
weaker statement. 

Lemma 14. Let w G have (3(w) > 3. Then a2(w) < 0. 

Proof. For simplicity we assume that w is cyclically reduced. (Again, this 
assumption is possible because both j3(w) and $ w (n) are invariant under 
cyclic reductions of w.) When f3(w) > 3, there is only one graph T G 
Q w of characteristic 1 (the universal graph), and all the quotient graphs of 
characteristic 2 have type A. To find the contribution of T to a 2 (w) expand 

the expression — i=1 .,- — to first order. It follows that this contribution 

is — ("r) +Y^ k j=i ^2) ' ^ e neec ^ t° show that there are at most ("j) — ^ ■ 1 ( e r) 
graphs T e Q w with x(T) = 2. 

Since T is generated by {s , s\ w \}, and every quotient T G Q w of charac- 
teristic 2 has type A, T is generated from f by a single pair of vertices of f\ 
The total number of pairs is y^), but clearly different pairs may generate 
the same T. For instance, for any two j-edges, the pair of heads generates 
the same quotient as the pair of tails. 

In order to understand the full picture, we introduce a graph T, which 
captures this kind of dependency between pairs. The graph T has m ver- 
tices labeled by the pairs of vertices of f , and has Y^j=i Cl ) e dges, one 
for each pair of same-color edges in T. The edge corresponding to the 
pair {ei,e 2 } of j-edges, is a j-edge from the vertex {head(ei), head(e 2 )} to 
{tail(ei), tail(e2)}. We illustrate this in Figure HI 

We claim that T has no cycles. As w is assumed to be cyclically reduced, 
T is simply a cycle and the path of w through it is a simple cycle. Now 
say that {xi, yi}, {x 2 , 1/2}, ■ ■ ■ , {x r , y r }, {xi, Vi} is a simple cycle in T, whose 
edges compose some word u. This u corresponds to two non-backtracking 
paths in V in one of the two following ways. Either u is a path from x\ to 
itself and a path from y\ to itself (whence it is some cyclic shift of w or of 
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{v,,v 2 } {v,,v,} {v,,v 5 } {v 3 ,v 7 } {v 3 ,v 4 } {v,,v 7 } {v 2 ,v 4 } 




Figure 4: The graph T (on the right) corresponding to the universal graph 
T (on the left) for w = g\gi9\9i9\^ 92- (f's vertices are denoted here by 
Vi, . . . ,v 7 while the Sj labels are omitted.) 



w 1 ). Or u is a path from x\ to y\ as well as a path from y\ to x\. (See 
Figure El) 




Figure 5: A cycle in T yields one of these three scenarios. The thick line 
stands for the universal graph T. The broken lines (arrows) stand for copies 
of u, the subword composed by the edges of the cycle in T. In the left and 
middle scenarios, u is a path from x\ to itself and a path from yi to itself 
(in the left graph the paths are equally oriented, in the middle one they have 
inconsistent orientation). The right graph describes the case where u is a 
path from X\ to y\ as well as a path from yi to X\. All three scenarios are 
impossible. 

In the former case, some cyclic shift of w equals another cyclic shift of w 
or of u> _1 . But w is primitive, so it is not invariant under any cyclic shift. 
To see that w cannot equal a cyclic shift of its inverse, we need to show that 
w~ l is not a subword of w 2 . To see this, let w — g\ . . . g m and assume to the 
contrary that Vj = 1, . . . , m g~ l = g s ~j for some m + 1 < s < 2m (here 
gi+m — g% for i = 1, . . . , m). Is s is even, say s = 2q, we conclude (for j = q) 
that g q = g^ 1 which is impossible. If s = 2q + 1 we conclude for j = q that 
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g^ 1 = g q+ i, so that the word w is not reduced. 

In the latter case, either u = which is impossible by the same argu- 
ment, or w is a cyclic shift of u 2 , which again contradicts its being primitive. 
This rules out the possibility of a cycle in T. 

Clearly, two pairs of vertices in V(r) which belong to the same connected 
component in T, generate the same T. Thus the number of different Ps in 
Q w with characteristic 2 is at most the number of connected components in 
T. But T has no cycles, so the number of connected components is exactly 

Note that this last proof yields a surjective function from the connected 
components of T to type- A graphs of characteristics 2 in Q w . In order to 
prove that <ft(w) = 2 f3{w) = 2 it suffices to show that this function is 
injective (which we believe is true). That would yield that the number of 
type-A quotients of characteristic 2 exactly balances off the negative contri- 
bution of T to d2{w). Thus, 02(10) 7^ (or more precisely 02(10) > 0) iff there 
is a type-B graph of characteristic 2, i.e. iff f3(w) = 2. 

In fact, we believe that something similar happens in general. Namely, 
for every integer % > 0, if Q w has no type-B graphs of characteristic < i, then 
the contributions from all the type-A graphs of characteristic < i. (i.e., the 
graphs generated from the universal graph by fewer than i pairs) balances 
out. Hence a (w) = a^w) = . . . = a^iiw) = 0, and Oi(w) equals the number 
of type-B quotients of characteristic i. 

There are three kinds of supporting evidence to this belief. As we saw, it 
is valid for i — 0, 1, and we have a good understanding why it should hold for 
i = 2 as well. Also, we have carried out extensive numerical simulations to 
test it for i — 2, 3, 4 without any failure. Finally, consider a word w such that 
w ~ a. Such a word has only type-A quotients, and we know that cij(u>) = 
for all i. In this case, therefore, the type-A quotients of characteristic < i 
indeed balance the contribution of each other to ao(w), . . . , di{w). (We note 
that the vanishing of all a,(w) even in this specialized case is not obvious). 

Here, then, is a formal statement of this main conjecture: 

Conjecture 15. For every w G F^, 

<j>(w) = 0(w) (10) 

Moreover, if (f>(w) = i, then 

Oi(w) = \{T E Q w : x( r ) = % and T has type B}\ (11) 
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Conjecture [15] is the main missing link in our tentative proof to the claim 
that E(/i maa .) < O(p). We return to this in Sectional Note that this conjec- 
ture implies that the first non-zero coefficient in the power series of <& w (ri) 
is positive. This means that E(Xw^) > 1 for every w G F k and every large 
enough n. 

Remark 16. It might be tempting to suspect something even stronger, namely 
that for every w G F k and n > 1, E(X^) > 1. However, this stronger 
assertion is false. (We would like to thank Miklos Abert for showing us the 
invalidity of this claim. The main ideas of the proof can be found in |Abej .) 

Finally, we present a second conjecture based on other simulations we 
conducted. These simulations suggest that the only words for which <f)(w) = 
oo are those mentioned above: 

Conjecture 17. E(xt ] ) = 1 iff w is equivalent to a single-letter word. 

3 The Largest New Eigenvalue in a Random 
Lift of a Graph 

In this section we apply our findings concerning formal words and word maps 
on S n to study the new eigenvalues in random lifts of graphs. Our main result 
is Theorem [1] which says that fi ma x < 0(X^ 3 p 2 ^ 3 ) almost surely. 

Recall the definition of 4>{w) for w G F k (Equation ([8])). Namely, Q w (n) = 
""^^(J) ^ ■ We first seek an improved estimate of the o(l) term in the 
numerator. 

Lemma 18. Let w G and let i > be some integer. Then: 

\{TeQ w : x (r) = *}|<M 2 * 

Proof. As we saw in Lemma El each F G Q w with characteristic i is generated 
by some set of i pairs. There are (^ w ^ 1 ) < \w\ 2 pairs to choose from, and the 
claim follows. □ 

Lemma 19. If <p(w) = i and if n > 3|w| 2 7 then 

1 / \w\ 2i+4 \ 

$ w (n) < - ai{w) + J-! . (12) 

rv \ n J 

Proof. As mentioned in the beginning of Section T2.31 ai(w), the coefficient 
of i in the power series of $ w (n), is completely determined by quotients in 
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Q w of characteristic < %. We now bound the contribution to a.i(w) of every 
such quotient. 

To this end, we analyze the contribution of T to $ w (n), as specified in flTj). 
For the sake of convenience, we let x equal -, and express this contribution 



as x 



e r -v r +i . nr=i (1-^) 

nLiiiilr^i-fa) 



-. For small x we can expand the fraction in this 



expression as a power series E^^o K xT - Write nj=i (1 — ^e) = 1 + Sr>i c 



x 



and n*=i n£i X (l - ^) = 1 + Yj r >i d rX r . Then: 
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1 and for r > 1, b r — c r — d r — &iG? r _i — ... — b r ^\d\. We have 
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Similarly, |d r | < A simple induction now shows that \b r \ < \w\ 2r : 
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Now consider the coefficient ai{w). There is at most one quotient in Q w 
of characteristic (Lemma [15]) which contributes at most \w\ 21 to a,i(w); 
There are at most \w\ 2 quotients of characteristic 1 which contribute at most 
21-2 each, etc. There are no quotients with characteristic greater then \w\, 



\w 



so we have at most contribution of \w\ of quotients of every characteristic 
< % < \w\. Thus a,i(w) < (\w\ + l)\w 
This yields the following: 



2 1 
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i=cj>(w) 
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(\w\ + l)\w\ 2 ^+ 2 n 
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n 



n — \w\ 



The lemma now follows because n > 3 1 w I 2 . 



□ 
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The set of possible values for (3{w) is {0, 1, ... , \w\} U {oo}, and we now 
split the sum over w G CV t (G) in according to (3{w). This yields: 

E(AO < £ [E(X(T))-1]= £ n-$>) 

weCV t {G) weCVt(G) 
ie{o,i,---,*}u{oo} toeCP t (G) 

The statement of (the unproved) Conjecture fT5l implies that the sum over 
w with (3{w) = oo vanishes, since (3{w) = oo yields 0(u>) = oo and hence 
= 0. We suspect that it should be possible to bound the number of 
words with f3{w) = i (Some results along these lines are proved in Section 
13.11) . This, combined with the statement of Conjecture[15]would have allowed 
us to bound the contribution of each < i < t to the above sum. 

This problem is still open, so instead we split the set CVt(G) into four 
parts: 

E (/W*) < T.weCV t (G) n ■ ®w(n) + T,wecv t (G) n • $«(n) + 
/3{w)=0 /3(w)=l 

Y.wecv t (G) n ■ $ w (n) + J2wecp t (G) n ■ $ w (n) (13) 

/3(w)=2 P(w)>3 

Using Lemmas fT2l and [TBI we can bound the value of § w {ri) when f3(w) = 
or 1. For these values f3{w ) = 0(w) and Lemma [T9l can be applied. If (3{w) = 
2, then 4>(w) > 2, and we can use (1T2"]) in its worst case, i.e. when (f>(w) = 2. 
Finally, if l3{w) > 3, the following lemma shows that & w (n) < O (A)- 

Lemma 20. Let w6F t have j3(w) > 3. Then 

1 / | w | 10 \ 
$ w (n) < -r a 3 {w) + 

n 6 \ n J 

Proof. The assumption (3{w) > 3 yields that a (w) = ai(w) = and that 
a2{w) < (Lemma fT4l) . Thus clearly Q w {n) < Yl^Lz a i( w ) > an d the claim 
is an immediate consequence of the analysis in Lemma fT9l (This is true since 
the proof of Lemma [[9] does not take full advantage of the assumption that 
(j){w) = i, but rather that dj(w) < for j < i.) □ 

To proceed with our analysis of Equation ({TBI , we now bound the number 
of words in CVt{G) with (3{w) — i for i — 0, 1, 2. 



24 



3.1 The Number of Words w G CV t (G) with Fixed j3(w) 

Our proof for this bound extends an idea that originated with |Buc86] and 
was later developed in [Fri03] . Recall that p denotes the spectral radius of 
T, the universal cover of the base graph G (as well as of any lift of G). Buck 
found a bound expressed in terms of p for the number of words in CVt{G) 
that reduce to 1. Friedman used a similar method to bound the number of 
imprimitive words in CVt{G). We further develop the method in order to 
bound the number of words in CVt(G) with f3{w) = 2. 

We present the three cases (i = 0, 1, 2) one by one. The case i = is 
indeed the simplest, and things get more complicated as i grows. (However, 
it does seem that a general bound can be proven for the number of words 
in CV t {G) for any fixed value of /?(•).) We first note that A T , the (infinite) 
adjacency matrix of T, is a self-adjoint bounded operator on the Hilbert 
space hiyiT)). Consequently, its operator norm equals its spectral radius, 
i.e. \\A T \\ = p. (The same argument shows that ||A^|| = p l for any integer 
I > 0). For every t>i,t>2 G V(T) the number of paths of length / from V\ to 
t> 2 is Aj,(vi, v 2 ), which can be bounded by \\A^\\ = p l . 

For every x G V(G), we arbitrarily choose some vertex v\ = V\(x) in the 
fiber of x in T. For every path 7 in G that starts at x, we consider the lift 
of 7 that starts at v\. We denote the tail of the lifted path by v 1 = v 1 (x). 

For i — 0, recall that (3(w) = w reduces to 1 (Lemma [T2l . Thus, 
every w G CVt{G) with j3(w) = corresponds to some path in G of length t 
which reduces to 1 (a nullhomotopic path). But these are exactly the paths 
which lift to closed paths in T as well. Thus: 

\{weCV t (G) : P(w) = 0}\ = A t(^i,v 1 )< 

xev(G) 

< Yl ft = \V{Cl)\ft (14) 

x£V(G) 

For i — 1 we want to count the number of imprimitive words in CVt{G). 
If w is imprimitive, then w = u d (equality in F&) for some u G and d > 2. 
Suppose that the path of the cyclically reduced form of u in G starts at 
x G V(G). Since the path of w visits x, there is some cyclic shift of w that 
starts at x. Thus, by adding a factor of \w\ = t to our eventual bound, we 
can assume w begins at x. 

We now let 7 be the path in G of the cyclically reduced form of u (a loop 
from x to itself). We divide the path of w through G to three parts: 

1 . a path homotopic to 7 of length l\ 
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2. a path homotopic to 7 of length l 2 

3. a path homotopic to r ) d ~ 2 of length Z 3 

with Zi + l 2 + h — t. 

These three parts lift to the following paths in T: 

1. a path from v\ to v 1 of length li 

2. a path from v\ to t> 7 of length l 2 

3. a path from v\ to ty-2 of length l 3 

Thus we can bound the total number of imprimitive words in CVt(G) as 
follows: 

\{weCV t (G) : p(w) = l}\< 

t 

<t-J2 E E E A4 1 (v 1 ,v^)A4 a (v 1 ,v^)A4 3 (vi,v^-2) 

d=2 rreV(G) 7 is a reduced 4-^2+^3=* 
loop from x to x 

(the initial factor of t accounts for the cyclic shift of w). 

Using the symmetry of A T , the inequality A^iy^v^-i) < p li and chang- 
ing the order of summations, we obtain: 

\{weCV t (G) : P(w) = l}\< 

t 

<*"EE E P h E A^(v 1 ,v,)Af(v 1 ,v 1 )< 

d=2 x&V{G)h+h+h=t v^eFib{x)\{vi}cV(T) 

<*(*-i) E E p^t i+ ' 2 (^^i) 

xev(G) h+h+h=t 

<*(*-i) E E 

= t(t- < |V(C)|tV (15) 

(The crux of the matter in this calculation is the second step which elim- 
inates the need to sum over closed paths 7). 

Next we bound the number of words in CVt(G) with f3(w) = 2. To 
this end we introduce a lemma that deals with quotient graphs of smallest 
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characteristic among all type-B quotients. Clearly, it is such quotients that 
determine j3(w). 

Let G be a graph, w G CVt(G) and T G Q w . Each label Sj that appears 
in a block that is associated with a vertex in T corresponds to a vertex in G. 
Therefore, a vertex in T may correspond to several vertices in G. We next 
show that for T as above each vertex in T corresponds to a single vertex from 
G. 

Lemma 21. Let G be a graph and w G CVt{G). If T G <2 W /ias i?/pe i? 
an<i = I3{w), then all labels that appear in a block from a vertex in T 
correspond to the same vertex in G. 

Proof. The proof proceeds by showing that otherwise there is another type- 
B quotient in Q w with smaller characteristic. Indeed, let {s^, . . . , s ir } be a 
block in T where the labels {sj i; . . . , Si k } (k < r) correspond to the vertex v in 
G, while the labels {si k+1 , . . . , Sj r } correspond to other vertices in G. Define 
the partition f by splitting this block to {s^, . . . , Sj fc } and {sj fc+1 , . . . , Sj r }. 
We claim that this is (i) a realizable quotient in Q w (ii) of characteristic 
x(r) — 1 and (iii) of type B as well. 

To see (i), recall that every letter in w corresponds to some edge in G. 
Since the two parts of the split block correspond to disjoint sets of vertices 
in G, there is no j such that both of them are heads (or tails) of a j-edge. 
Thus, the realizability of T yields the realizability of F. In deriving T from 
T we have increased the number of vertices by one with no additional edges, 
whence (ii) is proved. To show (iii), note that any generating set of T can 
be extended to a generating set of T by adding {s^, $i k+1 }, so that if f has 
type A, so does T. □ 

As we saw (Lemmas [S] and [H]) if w' is the cyclic reduction of w, then 
/3(w') = (3{w). Moreover, every type-B Y G Q w with x(r) = P(w) can be 
generated from some T' G Q w i with x(F') = x(r), through a series of co- 
operations (as in Lemma [9]). Since w' is cyclically reduced, every vertex in 
r" has degree > 2. (Indeed, T' is obtained from T by successive elimination 
of vertices of degree one in the graph). 

Now assume /3(w) = x(r) = 2. There are three possible shapes that T' 
can have: Figure-Eight, Barbell or Theta (see Figure E]). For a cost of an 
additional factor of t as above, we may assume the path of w through T begins 
at the vertex x specified in each of the diagrams. (More accurately, it begins 
at the vertex of V corresponding to x through the series of ^-operations.) By 
Lemma [HJ % corresponds to a unique vertex in G which we call x as well 
(by abuse of notation). This vertex x in G marks the starting point of w. 
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We now analyze each case separately, and using the notations in Figure El 
we trace the path of w' through V. 

Y 




4 



Figure-Eight Barbell Theta 

Figure 6: The three possible shapes of quotient graphs of a cyclically reduced 
word with characteristic 2. The edges 7, ( and r\ denote subwords. 

Assume first that T' has the shape of a Figure-Eight. In this case w' 
can be expressed using 7, ( in a reduced expression in which each of them 
appears at least twice (Lemma fTUl). For any fixed reduced expression in 7, (, 
we specify certain two appearances of 7 and certain two appearances of (. 
The path of w through T can be then divided to at most seven parts: four 
parts for the chosen appearances of 7 and (, and three parts for sequences 
of the rest of the expression (we can always choose the first two characters 
in the expression, but we may be forced to have spaces between the second 
and the third, between the third and the fourth and after the fourth). We 
then proceed to a calculation as in ffl5|) . 

To illustrate, let w' = 777C7~ 1 C~ 1 C~ 1 7- We split w' to seven parts as 
shown in the following bracketing: w' = (7)(7)(7)(C)(7 _1 )(C X )(C X 7)- The 
corresponding lengths are: l\ steps for the first 7, I2 for the second, /3 steps 
for the next 7 (which is considered "a space"), U steps for (, l 5 for the space 
7 -1 , Iq for £ _1 and l-j for the space C _1 7- We can now bound the total 
number of words which reduce cyclically (and with a possible cyclic shift) 
to this expression in some 7 and (. (The first factor of t in the calculation 
accounts for the initial cyclic shift of w.) 

\{w E CV t (G) : w reduces cyclically to 777C7~ 1 C~ 1 C~ 1 7 f° r some 7, (} \ 

(The second sum is over all 7 and ( - two distinct non-empty reduced loops 
from x to itself in G.) We proceed as before: 
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iGV(G) h+-+/7=i 

<*• E E p h+k+h p Zi+ V 4+ie < i^(G)|ty 

x<=V(G) h+...+l T =t 

This bound was calculated for a specific reduced expression in 7, £. The 
total number of possible expressions is less than 3*. (w.l.o.g every expression 
begins with 7, and it contains a total of between four and t components. For 
r components there are at most 3 r_1 possible continuations, and 3 3 + 3 4 + 
. . . + 3* _1 < 3*). Thus we can bound the total number of words in CV t (G) 
with f3(w) = 2 and which have a type-B Eight-Figure quotient graph, by 

|\/(G)|t 7 3y. 

For the Barbell and Theta the analysis is similar, but their contribu- 
tion is negligible relative to the contribution of the Figure-Eight. This time 
we construct a reduced expression in three subwords: 7, (, and rj, but the 
possible number of expressions is bounded by 2* (the same argument as 
above, only this time every subword has only two possible subsequent sub- 
words). We need to specify two occurrences of each of the three letters 
this time, so we may need to split the path of w' to 6 + 5 = 11 parts. 
The bound is therefore \V(G)\t 11 2 t p t , the asymptotic comparison is clearly 

|y(G)|t n 2y < |y(c)|t 7 3y. 

To illustrate the calculation, consider the following (reduced) expression 
for the Barbell: w' = 7?7CCC ? 7 _1 7 _l7 7C 7 7 _1 7- The number of words in CVt{G) 
which reduce cyclically to such an expression can be bounded by: 

< i-E.^cE,,...,!, A^tv^^x^A^^Mx^^My))- 

■A^, 4 ( Vl (y)Mv)) a t (vi (y)Mv)) a t K (z) Mx))- 

x V v v eFib(y) 

< t-Y, h ^ h P t - h ' k E E Af( Vl (x) : v v )A^(v v Mx)) 

x v v €V(T) 

< |v(G)|ty 

(Here x and y are vertices of G, 7 (resp. () is a reduced loop from x (resp. 
y) to itself, and rj a reduced path from x to y, and l\ + . . . + Is — t). 
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To sum up, we have for large enough t: 

\{w e CV t {G) : 0(w) = 2}| < (1 + o(l))|y(G)|t 7 3V (16) 

Remark 22. These calculations suggest that for every integer r > 2, the dom- 
inant figure among quotient graphs of characteristic r (after cyclic reduction) 
is a bouquet with r loops. Thus, for large enough t, the number of words in 
CV t (G) with (3{w) = r is less than (1 + o(l))\V(G)\t 4r - 1 (2r - 

Remark 23. Note that this counting argument is quite wasteful, and involves 
a good deal of overcounting. For instance, in the case i = 1, we counted 
each word of the form w = u A twice: once with the root u and d = 4, and 
once with the root u 2 and d = 2. It seems that in fact, we have bounded 
J2u>ecp t (G) ai(w). 

/3(w)=i 

3.2 The Proof of Theorem U 

We now have all the necessary ingredients for a proof of Theorem [TJ Recall 
that in (1131) . we split the set of words CVt{G) to four subsets according to 
the value of /?(•). In Table [2J we collect the following information for each 
subset: A bound on the subset's size and a bound on the value of & w (n) for 
the words in the subset. This table highlights the significance of the proved 
and unproved relations between </>(•) and /?(•) to the analysis of the sum in 
( |T3l) . The value of </>(•) yields the bounds in the right column of the table, 
whereas /?(•) is used to derive the bounds in the middle one. 

The number of words with f3{w) = was bounded in (j!4p . and the bound 
for $ w (n) is from Lemma HH] (since ao(w) = whenever j3{w) = 0). In (II 5p we 
bounded the number of imprimitive words in CVt(G), and Lemma yields 
again a bound for <& w (n) for imprimitive w. (By Lemma fT8l ai(w) < t 2 ). The 
number of words with (3(w) = 2 was bounded in ( Tl~6l) (for t large enough), 
and this time we bound Q w (n) using the fact that <f>(w) > 2 for every word in 
this subset. The size of the fourth set is bounded by the total size of CVt(G) 

\V(G)\ 

|CT> t (G)|=tr(V)= E A/<|V(G)|A/, 

i=i 

and the bound on <& w (n) in this case comes from Lemma [201 and the analysis 
of cti(w) in the proof of Lemma [T9l 

We now select t to minimize the resulting upper bound on E(/i ma J). It 
is not hard to see that the optimal t is G(logn). Consequently, all terms 
are o n (l) and can be ignored. The information in Table [2] is now 
combined with (Tl~3l) to yield [for t = 0(logn)]: 
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The Value of (3(w) 


The size of the subset < 


®w{n) < 



1 


\V{G)\-ff 
\V(G)\t*.p< 


l + - 

n \ n I 


2 


|V(G)|t 7 -3* -p* 


J *(t i + *) 
n z \ n I 


> 3 


\V(G)\-\f 


\{^ + ^\ 
n A \ n I 



Table 2: A bound for the size of each of the four subsets of CVt(G), and a 
bound for the value of <& w (n) for each word in the subset. 



EfrO < (l + °n(l)) 



n ■ \V{G)\ ■ p l + \V{G)\ ■t & -p t + \V{G)\ ■ t 11 ■ 3* • P - + \V{G)\ ■ 3t e 

n 



< (1 + o n (l))\V(G)\ ■ t u ■ 4 [max(p ■ n 1/4 , p, 3p • n" 1 / 4 , \ x ■ n- 2 '')] 



In order to balance the first and the last terms, we set n 



P 



-1/3^1/3 



(recall that t is an even integer, so we cannot guarantee exact equality here). 
Since t —>■ oo with n, the constant and polynomial factors can be replaced 
by (1 + o n (l))*. We obtain 



E(> r 



< 



A x 1/3 p 2/3 • (1 + o n (l)) • max (l, (£) ^ , 3 (£) ^ , 1 j 
A 1 1/ V /3 -(l+o„(l))-maxM,3^ ' 



(the equality holds because p < Ai is always true, see Remark [2]). 

The statement of Theorem [T] now follows from a standard application of 
Markov's inequality. Obviously, for every e > 0, 3 • \^ 3 p 2 / 3 + e may serve as 
an absolute upper bound. 

Remark 24. Here is a sketch of our proposed approach to Friedman's Con- 
jecture. Say we seek to prove that all new eigenvalues are, almost surely 

r 1 

0(p 1 ~ e X\) for every e > 0. To prove a bound of 0{p' :F + l \l +1 ) one would have 
to show that f3(w) = i <f){w) = i for every i < r. We believe that this 
can be shown in a way similar to our proof of Theorem [TJ In addition, it 
would be necessary to follow up on Remark |2"2"| and establish a bound on the 
number of words w G CV t (G) with given (3(w). 
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4 The Number of L-Cycles in w(s\, . . . , s&) 



In this section we introduce a new conceptual and relatively simple proof of 
a Theorem of A. Nica [Nica94] . In (J2J) we defined the random variable Xw 
which counts the number of fixed points in w(ai, . . . ,<Jk) f° r fixed w. We 
extend this concept and for every integer L > 1 denote by X^' L a random 
variable on which is defined by: 

X w,L( a ^ • • • , o-fc) = # of cycles of length L of w(a u ...,a k ) (17) 

(X^\ is a new notation for xffl)- 

Nica's theorem says that the variables X^\ have, for fixed w and L and 
for n — > oo, a limit distribution which can be computed explicitly. (Unless 
otherwise stated, the distribution on is always uniform.) 

Theorem 25. Let 1 / to e and suppose that w = u d , with u primitive. 
Then for every integer L > 1, the random variable X^\ defined in (IT) has, 
for n — > oo ; a limit distribution, which is given by: 

^ hZi/Lh (18) 

h£H{d,L) 

where H(d, L) is the set 

H (d, L) = {h > : h\ d and gcd(^-,L) = 1}, (19) 



~ Poi(m) (a variable with Poisson distribution with parameter m) , and 

'-5" denotes convergence in distribution. 

In particular, this limit distribution depends only on d and L ( and not on 

u). 

Note that in the case that w is primitive (i.e. d = 1), the limit distribution 
is simply Poisson with parameter 1/L. 

Our proof of Theorem [231 is based on the method of moments and provides, 
in particular, explicit expressions for the moments of X^\: 

Corollary 26. For every 1 ^ w G F^, L > 1 and r > 1 i/iere exists 
a rational function ^ w ,L,r that is defined on a neighborhood of such that 
E([X^ n jj r ) = ^ Wt L,r(^/n) for sufficiently large n. Consequently, the limit 

lim^oo E([X^] r ) exists and equals ^^(O). In addition, if w = u d , with 
u primitive, then this limit equals the corresponding moment of the sum in 
(HED- In particular, ^ W>L>1 (0) = lim n ^ 00 E(X^) = 
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(Note that this is a generalization of the function <& w (ri) defined in (j3j): 
n ■ $ w (n) + 1 = E(XS) = * W)1)1 (l/n).) 

The contents of Nica's work is Theorem and Corollary for the first 
moment. 

The main idea of the proof was already illustrated in Section [2} where 
we analyzed the expectation of X^\. Recall Equation (TjJ where $ w (n) is 
expressed as a power series Y^=o a i{ w )'h- We already know (Lemma [T2"j) 
that when w ^ 1, a (w) = 0, and so lim n _^ 00 E(X^) = ai(io) + 1. But this 
is exactly the number of graphs V e Q w with xCO = 1 (see, for instance, 
Lemma [T3"]) . 

If u> is cyclically reduced, then the only graphs in Q w with x = 1 are 
cycles (again, see the proof of Lemma [T3l) . Such a cycle consists of a cyclic 
concatenation of several copies of u (the primitive word such that w = u d ). 
The number of copies of u in the cycle has to divide d, hence 

lim E(x£\) = \H(d, 1)1 = # of divisors of d 

But we can indeed restrict our discussion to cyclically reduced words. 
The justification for this is the following. Let 1 / tu 6 F*, and let w' be 
its cyclic reduction. We have already mentioned that w ~ w' and so they 
induce the same distribution on S n . In particular, X^ L and X^) L are equally 
distributed. It is also quite evident that w and w' share an identical exponent 
of their primitive root (i.e., if w = xw'x' 1 for some igFj and w' = u d with 
u primitive, then xux~ l is primitive too, and w = (xux~ 1 ) d ). Thus the 
validity of Theorem [25] and Corollary [26] for w', yields their validity for w as 
well. 

Below, we extend this argument to obtain the limit of the expectation of 
[^i"i] r f° r ever Y L and r. We essentially use the same way we counted fixed 
points in w(ai, . . . , Ok) for all fc-tuples (ai, . . . , 07.) G 5„ fe , to count L-cycles 
and sequences of L-cycles, which we call lists of cycles. The point is that 
the total number of lists of length r of L-cycles, divided by (n\) k , equals the 
expectation of [X^ n ]j , the r-th factorial moment of X^\. Once we know 
how to calculate the limits of the factorial moments, the limits of the regular 
moments are at easy reach. To finish, we show that these limits equal the 
corresponding moments in the r.h.s of (118j) . and use the method of moments 
to conclude the proof. 



33 



4.1 Lists of Trails and their Categories 

We begin by generalizing some of the notions from Section 12. 1[ Let l^we 
Ffc be cyclically reduced, n > 1 an integer, sq G {1, . . . , n}, and ci, . . . , o~k G 
SVi- The iraz/ o/so through w(ai, . . . , cr fe ) is the sequence of images of so under 
iu(<7i, . . . , cr fc ). Namely, if w = g^g™ 2 . . . (with G {-1, 1}) in reduced 
form, then associated with s is the following path: 

a ai a a2 a az a . am 

H l 2 »3 'm 

so — > -Si — > s 2 — > . . . — > S m 

with si, . . . , s m G {1, . . . , n}, and s& = ^^(sb-i) (6 = 1, . . . , to). (Recall that 
we compose permutations from left to right.) 

Likewise, we can speak of the trail through some power of w. For example, 
the trail of s through w 3 (o"i, . . . , o&) is 

a ai a a2 a am ff " 1 ff ° 2 a a ™ a ai a a2 a a ™ 
SO > S\ > . . . > S m > S m +1 > ■ ■ ■ S2m ¥ s 2m+l > ■ ■ ■ > S3r, 

with Si, . . . , s m as before, and s m +i, ■ ■ ■ , s^m G {1, . . . , n} satisfying the ob- 
vious constraints. 

We recall that two trails were placed in the same category if they have the 
same coincidence pattern. This notion can be extended to our present, more 
general context, in an obvious way. Namely, every category is associated 
with some directed edge-colored graph. Moreover, we can define categories 
not only of single trails, but also of lists of trails, and again, associate a graph 
to each category. The nature of this graph is exactly as described in Section 

EH 

To illustrate, let w = gxgig^ L 5 , 2 1 be the commutator word, n > 8 an 
integer and o~i, <T2 G S n such that the following trails are realized by w(ai, 02) 
and u> 2 (<7i, cr 2 ), respectively: 



We denote the nodes of the trail through w by sj, . . . , s\ and the nodes 
through w 2 by Sq, . . . , s|. Then the graph associated with the category of 
this list of trails through w, w 2 is shown in Figure [71 

Although the notions here have a wider scope, we limit our discussion 
to categories of trails which represent cycles in w(ai, . . . , a^)\ an L-cycle is 
represented by a closed trail through w L (o~i, . . . , Ofc). In fact, we are interested 
in counting cycles of a given length, so for our purposes we can confine 
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strating point of 
the path through w 



strating point of 
the path through w 




Figure 7: An example of a graph associated with a certain category of a list 
of trails through w,w 2 , where w = gxgi9\ $2 i s the commutator word. 

ourselves to categories of a list of r trails through w L (ai, . . . , <7fc), for some 
L,r > 1. 

First, let us analyze the categories that represent a single L-cycle. A 
closed trail through w L (<Ti, . . . , cr^) represents an L-cycle only if it does not 
represent any smaller cycle. That is, in the closed trail 



s 



si 



S2r 



SLm—1 



SO, 



the L labels sq, s m , S2 



, S m , S2m, ■ ■ ■ , *(L— l)m 



must all be distinct. 



Once again, the graph of each category is a quotient of the universal 
graph. In this case, the universal graph is Cz,. m , the cycle of length L ■ m. 
A partition of its vertices corresponds to some category of an L-cycle if and 
only if it does not create "collisions" of same-color edges, and keeps apart all 
L vertices corresponding to sq, s m , S2 m , ■ ■ ■ , SjX-iw We demonstrate this in 
Figure 



The most general case in our discussion comes up when we turn to calcu- 
late the r-th moment of X, 

L-cycles through w(g\. 
r ordered cycles of length L each, consists of a disjoint union of r copies of 
Cl-tu- We name the vertices of the first cycle in T W) i,r by sj, . . . , si m _i, the 
vertices of the second cycle by Sq, . . . , s\ m _ x and so on until s r n sL_i for 



w L . To this end we consider categories of lists of r 
,<jfc). The universal graph Y w L r which represents 



5 o> 



'Lm-l 



the r-th cycle. 

We are interested in quotients (or partitions of the vertices) of T Wj l iT . that 
represent r distinct L-cycles. This means that the vertices 



s 1 s 1 



,1 



1 2 2 

S (I-l)m) S q, S m 



5 s (L-l)m) 



S 0? S mi 



) b (L-l)m 
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Figure 8: The universal graph of a 3-cycle when w = g\g29\ 9<2 ■ ^ n a 
realizable partition of the vertices that represents a 3-cycle, the 3 circled 
vertices (so, S4, sg) must belong to distinct blocks. 



(a total of L ■ r vertices) should be kept apart in every partition, as illustrated 
in Figure [91 

(n) 

4.2 Formulas for the Moments of X' „ T 

The method we use to calculate the expectation of [X^] r is quite similar to 
the one we used for E(X^ n j). We first calculate the "factorial moments" of 
X^\ and derive the regular moments from them (the r-th factorial moment 
of a random variable X is defined as E([X] r ), where [X] r is the "falling 
factorial" , namely [X\ r = X(X - 1) . . . (X - r + 1) ). 

■X-wL counts lists of r L-cycles in w(ai, . . . , cr&). As in the case of the 

L ' J r 

first moment, we calculate its expectation by counting the total number of 
lists of r L-cycles in w(a±, . . . , 0^) for all fc-tuples (ai, . . . , a^) G and 
dividing by (n\) k . 

The counting is carried out by classifying these lists into categories. Each 
list of r L-cycles with specified starting point for each cycle, belongs to some 
category. These categories are the quotients of the universal graph T Wt L jr , 
which we denote by Q Wt L,r (e.g., Q w ,i,i is the same set as Q w ). To recap, the 
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Figure 9: The universal graph 1^3 4 of four 3-cycles where w = g\g%gx 92 ■ 
In a realizable partition of the vertices that represents four 3-cycles, the 12 
circled vertices must belong to distinct blocks. (We omit the blocks corre- 
sponding to the rest of the vertices.) 

set Q w> L,r can be generated as follows: 

We first draw T w ^ r , the universal graph of r ordered L-cycles of w, which 
consists of r disjoint cycles each of which has L ■ \w\ vertices. Q w ,L,r consists 
of quotient graphs that are generated by partitions of the vertices of T w ^^ r . 
A quotient graph is included in Q w ^,r if it is realizable, and if in the corre- 
sponding partition each of the r • L vertices that represent the r L-cycles is 
in a different block. 

Let T be some graph in Q W) L,r- A realization of T is a fc-tuple of permu- 
tations 0j, . . . , <7fc G S n , a list of r L-cycles of 10(01, ... , &k) an d a specified 
starting point for each cycle, such that they belong to T's category. The 
number of realizations of T is the same as in (jSJ), namely: 

k 

N r (n) = n(n - 1) . . . (n - v r + 1) JJ (n - e{) ! (20) 

3=1 

Since every list of r L-cycles is counted U times (there are U ways to 
choose the starting points), we have: 
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E([*£]jr) = p. E [^iK-,^)] r = 

s4 E *w= (2D 



(n!l* L r 

1 ^-^ n ( n — 1) • • • — t>r + 1) 



^ r e ^ p n^n(n-l)...(n-4 + l) 

^ e n [^7 



Note that the equality between (12111 and ( 1221) holds only for n large 
enough. Indeed N r (n) = if n < e 3 r for some T G Q«j,L,r and j G {1, . . . , k}. 

This holds for n > maXj^\ y __^{ej(^ w> L > r)j- 

For every L > 1 and r > 1, ( |22i) thus yields a rational function in n, 
which, for sufficiently large n, is the r-th factorial moment of L . It is 
convenient to rewrite (1221) as a function of -: 

We can now define a rational function i/j W! L, r by: 

vw(*)=i e * x(r) ~ x fc n£ C (1 r te) ^ 

r£6io,L, r rii=i n*=i (i-te) 

and so E([X^] r ) = ^,L,r(^) for n > max j=1 ^ k (e^f w ,i,,r)) . 

The following lemma shows that ip w ^,r is well defined on a neighborhood 
of 0. 

Lemma 27. For each T G Q w ,L,r, X<T) > 1. 

Proof. Note that in 1^ ^ r every vertex has degree 2, since to is cyclically 
reduced. This degree can not decrease in a quotient. Therefore, every vertex 
in every T G Q w ,L,r has degree at least 2, and the lemma follows. □ 

It is well-known how to express the regular moments as linear combi- 
nations of the factorial moments. Thus we can derive (an efficiently com- 
putable) rational function ^ Wj L,r (which is a linear combination of i/j Wi l,i, ■ ■ ■ , i>n 
such that E([X^] r ) = ty wLr (l/n) for sufficiently large n. This function 
^w,L,r is obviously defined on a neighborhood of 0, which proves the first 
part of Corollary 1261 
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4.3 Proving Theorem 125] with the Method of Moments 



The proof of Theorem 1251 is based on the method of moments. Under certain 
mild conditions, a probability distribution is determined by its moments, or 
as here, a limit distribution is determined by the limits of the moments. 



4.3.1 Some Facts from the Method of Moments 

A probability measure \i on R is said to be determined by its moments if it has 
finite moments a r = x r fi(dx) of all orders, and /i is the only probability 
measure with these moments. We quote Theorem 30.2 from [Bil95j: 

Theorem 28. Let X and X n (n G N) be random variables, and suppose 
that the distribution of X is determined by its moments, that the X n have 
moments of all order, and that lim^oo E(X n r ) = E(X r ) for every r G N. 
Then 

X n ^ X. 

Note that if X and the X n are integer- valued then X n X is equivalent 
to Pr(X n = k) — > Pr(X = k) for every integer k. 

The relation between regular moments and factorial moments implies: 

Corollary 29. The statement of Theorem {MB holds where moments are re- 
placed by factorial moments. 

In this section we use Corollary [2H1 to prove Theorem [23 That X^ ] L has 
moments of all order is evident. We still need to show that the r.h.s of CE 



is determined by its moments, and that the r-th factorial moment of X^ L 
indeed converges to the r-th factorial moment of this random variable. 

Theorem 30.1 from [Bil95j provides a sufficient condition for a probability 
measure /i to be determined by its moments, namely, that the power series 
Yl r a rt r /r\ where a r is the r-th moment of \x has a positive radius of conver- 
gence. (This series is the moment generating function of /i, when the latter 
exists.) For fi a Poisson distribution (with any parameter), this power series 
converges for all real t (e.g., |Pit97j . section 4), so \i is determined by its 
moments. A convolution (a summation) of several Poisson distributions is 
itself Poisson (whose parameter is the sum of parameters), and thus satisfies 
the condition as well. 

In particular, recall that the r.h.s. of ( |T8l) is Y^h£H(d,L)hZi/Lh-> where 
Z m ~ Poi(m). If we omit the constant h of every term in this sum, we 
obtain Y,h^H(d,L) Z i/Lh, whose distribution is simply Poi{Y, h£H {d,L) 1 / Lh ), 
which is determined by its moments. According to the definition of H(d, L) in 
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(TH?|) . each h G H(d, L) satisfies 1 < h < d. Thus, if we denote by a r the r-th 
moment of Y.h&H(d,L) Z i/Lh and by (3 r the r-th moment of Y,h&H(d,L) hz i/Lh, 
then a r < p r < d r a r . Consequently, the series ^2 r /3 r t r /r\ has radius of con- 
vergence that is > ^ that of the series ^ r a r t r jr\. But the latter converges for 
all real t, hence so does /3 r t r /r\, and the distribution of J2heH(dL) hZi/Lh 
is determined by its moments. 



To conclude the proof of Theorem 1251 we need to show that the (factorial) 
moments of X^ L indeed converge to the respective moments of the r.h.s. of 

(TTgj) . But what is the limit of E([X^] r )? By Lemma [2H the limit of each 
term in the r.h.s. of (1231) is either (if x(T) > 2), or 1 (if x(r) = 1). 
Therefore, 

lim E([X™) r ) = j— | {r G Q w , L ,r : X(T) = 1}| (25) 

n^oo Li 

As explained in the proof of Lemma [23, the equality x(r) = 1 holds for 
some T G Q w ,L,r, iff every vertex in T has degree 2, i.e., iff T is a disjoint 
union of cycles. 

We denote by C w ^, r the subset of Q w ,L,r consisting of all graphs which 
are a disjoint union of cycles. (1251) now becomes: 

lim E([*£ B Jj P ) = ^\C w>L>r \ (26) 
4.3.2 The Graphs in C w>L>r 

Let w G Ffc be cyclically reduced and equal w d with u primitive and d > 1. 
A graph T G C Wi L jr has a very specific structure: Each cycle c in T must be 
a cyclic concatenation of several copies of u (every cycle in F looks like T u ^ t i 
for some positive integer b). 

To see this, recall that each cycle c in T represents a closed trail through 
w L (at least one closed trail). Hence there is some vertex x in c and some 
orientation on c, such that if we leave x in this orientation and go exactly d-L 
times through u, we get back to x (possibly after tracing c several times). 
Since u is primitive, it cannot be invariant under cyclic shift of any length 
/ < and the size of c must divide \u\. (In fact, the integer |c|/|u| divides 
d-L). 

Moreover, all closed trails that are represented in c, go in the same direc- 
tion (and thus also start in one of the |c|/|w| head vertices of u). We already 
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saw in the proof of Lemma HH that for u primitive, u~ x is not a subword of 
u 2 . This rules out the possibility of "finding u in the opposite direction". 

This analysis of the structure of the graphs in C Wi L,r yields an impor- 
tant corollary, which ultimately explains why the limit distribution of X^\ 
depends solely on d and L, and not on u: 

Corollary 30. Let W\ ) W2 G be cyclically reduced and equal uf and u$ , 
respectively, with ui and primitive and d > 1. Then 

\^wi,L,r\ | ,L,r | 

Proof. The analysis above shows that the inner structure of u is completely 
irrelevant to the graphs in C w ^, r , and there is a natural bijection between 
C Wl ,L,r and C W2) Ly. simply replace each copy of U\ by a copy of u 2 . □ 

4.3.3 The Simple Case of a Primitive Word 

The material in this section is not necessary for the proof and deals with 
the special case of primitive words. Our hope is that this spacial case makes 
it easier to follow the general proof. So let us assume that u = w and 
d — 1. The universal graph r„,£ jr is a disjoint union of r cycles, each of 
which consists of L copies of u. We have a total of r ■ L copies of u, and their 
r • L initial vertices are kept separated in each quotient. It is easily verified 
that the only quotient of T w ^,r which is a disjoint union of cycles is T w ^,r 
itself, and thus IC^^i^J = 1. By we have lim n _ i . QO E([X^] r ) = jj. 

On the other hand, when w is primitive, the r.h.s. of (fl8l) is simply Z\/l. 
Let X be an integer- valued non- negative random variable and let fx{t) be 
its generating function 

oo 

f x (t) = Y,Pr(X = j)V. 

3=0 

Under certain mild conditions and in particular if fx{t) is analytic on IR, 

E([X] r ) = /i r) (l) WreN. 
The generating function of Zul is f(t) = e^~ei . Thus, 

E([Z 1/L ] r ) = /M(l) = -i 
which completes the proof of Theorem [25] for w primitive. 
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4.3.4 The General Case 



To complete the proof of Theorem [25j we show that for every w = u d G , 
L > 1 and r > 1, the series E([X^] r ) converges, for n — > oo, to E([Y"d,L]r), 
where y diL is the r.h.s. of ([18]), i.e. F rfjL = Y^heH(d,L) hz i/Lh- 

Let X x and X 2 be non-negative integer-valued variables with generating 
functions fx 1 (t), fx 2 If ^ = X t + X 2 , then clearly K's generating function 
is fy(t) = fxi{t) ■ fx 2 (t)- The generating function of hZ m {h e N, Z m ~ 
Poi{m)) is fhz m (t) = e~ m • e m * . Therefore, 

= n e ~^ e ^ 

h£H(d,L) 

and if Lf(d, L) = {hi, . . . , /i p }, then 
E([V«] r ) = 0«)| t=1 

- e (jJlW^ML ("> 

ri +...+r p =r v 1 p/ j=l 

rj>0 

By ( 1261) . the series E([X^ n ]j r ) converges to ^ |Cuj,£,,r I ? so we proceed to 
analyze the set C w ^, r - Let T be a graph in C Wj L,r- As explained in section 
14.3.21 T is a disjoint union of cycles, each of which consists of several copies 
of u. If we let b denote the number of copies of u in some cycle, what are the 
possible values of bl To begin with, b\dL, as this cycle represents a closed 
path that consists of dL copies of u. Secondly, as this cycle represents an 
L-cycle, L\b and b \ dL' for any 1 < L' < L. If we let h — 4, then the 
constraints on b translate into the following conditions: h\d and (t,L) = 1, 
precisely as in the definition of H(d, L). Note that a cycle in T with b = Lh 
copies of u can represent up to h distinct L-cycles in w(ai, . . . , 0>). 

As before, let H(d, L) = {hi, . . . , h p }. For j = 1, . . . ,p consider those L- 
cycles which are associated in the quotient T to a cycle of length Lhj-\u\. (The 
z-th L-cycle belongs to the cycle c in T if the blocks containing s l Q , . . . , s^kd-i 
correspond to vertices in c.) Let rj be the number of such L-cycles whence 
Y2 r j — r i an d there are ( ri r r ) ways to choose which L-cycles go where 

(Recall that T Wj L, r consists of an ordered list of r L-cycles). Now let C^ Lr 
denote the subset of C W} L, r consisting of all quotient graphs where all disjoint 
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cycles are of equal length of Lh\u\ each. Then we have: 



ri+...+r p =r v y/ j=l 

rj >0 

S (rv.-rj II 1/7 1 C « 

n + ...+r- p =r \ 1 j=l 



:28i 



(C^ L0 denotes the singleton of the empty graph, and therefore |C^ i0 | = 1). 

By combining ( 1271) and ( 1281) , we conclude that Theorem [25] will follow if 
we show | 

^rl^w.vl = //i0 1/z ,fc(*)|t=i> ( 29 ) 

for every L > 1, r > and h G iL(<i, L). 



We begin with the l.h.s. of (1291) . Recall that by definition, each graph 
T £ Cjj ir consists of a disjoint union of cycles of length Lh\u\ each. Thus, 
each cycle represents up to h distinct L-cycles of w(ai, . . . , o^), and T can 
represent up to h ■ (# cycles in T) distinct L-cycles. But T represents only r 
distinct L-cycles, whence there are h ■ (# cycles in T) — r "free spots" in T 
that can contain new L-cycles. We illustrate these notions in Figure [TU1 

Now let a 1 ^ Lr [j] denote the number of graphs in L r with j free spots. 
We can define the generating function of the L r [j\- 

gt, L ,M = a UM + <l,,[i]* + <L,,[2]t 2 + • • • 

and obviously ^ (£>r (l) = \Ct, L ,r\- 

Before we derive a recursion formula for this function, we want to illus- 
trate by writing explicit expressions for r = 0,1,2. Every connected compo- 
nent (=cycle) in every T E C^ Lr , realizes at least one L-cycle. Thus, when 
r = and there are no L-cycles at all, we have only the empty graph which 
has no free spots, and so i (t) = 1. When r = 1, we have a single graph 
mC^ L1 , with h — 1 free spots (a single cycle of Lh copies of u), and therefore 
9w liW = ^ h ~ x ■ For r = 2 there is always a two-cycle graph with one L-cycle 
in each cycle. This graph has 2(h — 1) free spots. In addition, if h > 2, 
there are also graphs consisting of a single cycles that corresponds to two 
L-cycles. There are L(h — 1) ways to place the two L-cycles and such graphs 
have (h — 2) free spots. Thus, for h — 1, 9^12^) = t 2 ^ h ~^ and for h > 2 

gt, L , 2 (t)=L(h-l)t h ~ 2 + t 2 ^. 
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(s \... } u {si,..} 

Figure 10: One of the graphs T G C^ Lr , where w = u 3 for some primitive 
u E F k , h = 3, L = 2 and r = 3. (We use the labels s\ introduced in Section 
14.11 ) This T contains two cycles, each of which consists of six copies of u. 
Each of the cycles can correspond to up to three distinct L-cycles (indeed, 
h = 3), so T can contain up to six L-cycles. As it already contains three, it 
has 6 — 3 = 3 free spots. 

We now want to derive the functions L r by recursing on r. Let V be 
a graph in C\ ]Lr with j free spots. In what manners can we add another 
L-cycle and make it a graph in C^ Lr+1 7 We have two options: we can put 
the new L-cycle in one of the j free spots, in L possible ways (L possible 
cyclic shifts), resulting in j ■ L different graphs in Lr+1 , each of which has 
j — I free spots. Alternatively, we can add one new cycle to T and put there 
our new L-cycle, which yields a single graph in L r+1 with j + h — 1 free 
spots. Thus, we have: 

= L ■ {^LAt))' + ■ < L , r (t). 

We now go back to the right side of (1291) . Recall that fhZ 1/Lh (t) = 

1 i ( \ 1 £ 

e~TKeLh. If we write fhZy L) Sf) = ^~ TZ ^ TK • QL,r(t) where q^^it) is the appro- 
priate polynomial, then 

q h L ,o(t) = i 

and 

Thus g%, L r (t) = U ■ q\ r (t), and we can conclude: 

^KlA = \~ r ■ <L,,(1) = = fhZ^J 1 ) ( 30 ) 



{C- } 
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when the last equality comes from the fact that e LheLh 



t=i 



The proof of Theorem[25]is now complete. Corollary [26] is also proved. For 
completeness sake, here is a short proof of the last sentence in the corollary, 
namely, that lim n _ 00 E(X^) = |H( ^' L)I . 

Proof. 



hmE(X^) = E(y diL )=E( Yl hZ i/Lh\= E ( hZ VLh) 

h£H(d,L) J heH{d,L) 



h£H{d,L) 



1_ _ \H(d,L) 
Lh L 



□ 



Remark 31. In fact, the technique presented here are likely to yield further 
results. The method of moment applies as well to random vectors (for distri- 
butions that are determined by their moments, see, e.g., [JLROOj . Theorem 
6.2). The joint moments of X^ Li , . . . , X^l for some k and positive integers 
L\, . . . , L\~ can be analyzed similarly to the way we analyzed the moments of 
for some L, and the limit joint distribution of these variables is probably 
determined by its moments. 

Remark 32. It is of great interest to study word maps for other groups or for 
non-uniform distributions on S n . For results of this nature see |Ben06j . 

5 Open problems 

Many interesting questions and conjectures were raised in this paper. We 
collect them here. 

• Let u and w be two words such that for any finite group G, the dis- 
tribution of the two word maps on G are identical. Is it true that 
u ~ vol 

• (Conjecture [[5]) f3{w) = 4>{w) for every word w. 

• (A consequence of Conjecture [02 ) For every word w, and sufficiently 
large n, a random permutation in the image of w in S n has, on average, 
at least one fixed point. 



45 



• Friedman's Conjecture: For every base graph G, almost surely all new 
eigenvalues in lifts of G are < p + o(l). 

• Nica's theorem determines the behavior of the number of L-cycles in the 
S^-image of any formal word w. There are numerous other parameters 
of such permutations (e.g. the number of cycles) whose typical behavior 
is still not understood. 
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